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(57) Abstract 

The present invention is a collaborative server system for 
providing high speed data transmission of data files across a 
communications network which comprises a communications 
network, a primary server (14) having a primary communications 
component (36) for connecting the primary server (14) to the 
network and at least one secondary server (16) having secondary 
communications component (52) for connecting the secondary 
server to the network. The primary and secondary servers (14, 
16) include storage component for storing data files (40, 42). 
The data files include static data files (42) and/or dynamic data 
file (40). The storage component of primary server (14) stores 
at least one look-up table having specific criteria pertaining to 
the data files and the primary and secondary servers (14, 16). 
The processor of the primary server (14) is effective to receive 
a request for specific data files from a network user to look-up 
specific criteria in the look-up table. 
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WEB SERVING SYSTEM WITH PRIMARY AND SECONDARY SERVERS 

The present irfvention relates generally to a network of computer systems 
that transfer data files through a network connection, some of which are 
5 repeatedly transferred without changes ("static files"), and some of which may be 
modified for each transfer ("dynamic files"). More particularly, the present 
invention relates to web servers that are connected to a TCP/IP-capable network 
and transfer data files of the World Wide Web to computer systems connected to 
the network. In particular, the present invention is a web serving system that 
1 0 efficiently coordinates the efforts of two or more web servers to transfer data files 
rapidly through the TCP/IP-capable network while minimizing the cost of such 
system. 

15 BACKGROUND OF THE INVENTION 

A web serving system or web server is a computer system that runs web 
server software and connects to a communications network, such as the Internet. 
A publisher may use the web serving system to offer various data files for public 
access, including text, graphics, audio, video, and other types of data files. Thus, 
20 anyone having a computer that is connected to the same network may retrieve 
these data files offered by the publisher through the use of a standardized 
communications protocol. One protocol, namely the Hyper Text Transfer 
Protocol ("HTTP"), is commonly used on TCP/IP-capable networks to transfer 
data files of the World Wide Web. 

25 

Data files of the World Wide Web are transferred from a web server to a 
web browser computer via a TCP/IP-capable network. The web browser 
computer typically consists of a computer, web browser software running on the 
computer, and communications hardware to connect the computer to the network 
30 through a communications link. Likewise, the web server typically consists of a 
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computer, web server software running on the computer, storage hardware for 
storing the web pages, and communications hardware to connect the computer to 
the network through a communications link. A wide variety of communications 
links are available for connecting a computer to a TCP/IP-capable network, 
5 including an analog telephone connection, 56K line connection, ISDN connection, 
fractional Tl connection, Full Tl connection, cable-modem connection and 
satellite connection. 

The user may view the contents of a specific data file of a particular web 
1 0 server by operating the web browser software. In particular, when a user wishes 
to receive the data file, the user operates the web browser computer so that it 
indicates the network address of the appropriate web server and the name of the 
data file on the server. The web browser computer then establishes a clear 
communications channel between the web server and itself, and requests the target 

1 5 data file. Next, the web server retrieves the specified data file from the web 
server's storage hardware and transmits the contents of the data file through the 
communications network to the web browser computer. The web browser 
computer then creates an audio/visual presentation of the data file for the user by 
activating the various hardware subsystems, such as one or more video monitors 

2 0 and/or audio speakers. 

Web browser software has been enhanced to intermingle several pieces of 
text, graphics, video, audio, and animation together so that browsing or surfing the 
World Wide Web is an interactive and engaging process. In particular, 

2 5 standardized tags or sequences of text characters are included in the data files to 

control the placement of intermingled data files on the pages of the World Wide 
Web. A data file has text data and non-text data in which the standardized tags 
are included with the text data. The static data includes image data, animation 
data, video data, computer programs, and other types of non-text data. A data file 

3 0 containing a mixture of text and standardized tags is referred to as web page. 
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The standardized tags can include additional information, such as the web 
addresses, data file names of other web servers, and data files of graphic images, 
video clips, and audio recordings. Thus, most web browser software 
automatically presents an integrated display of dynamic data and embedded static 
5 data when displaying a webpage to the user. Accordingly, two distinct data files 
are retrieved to create a particular display: the first data file includes the text and 
tags, and the second data file includes the non-text data. 

There are typically over a thousand web browsing users for every web 
1 0 server in operation and, thus, web servers are subjected to numerous requests for 

data files. Most contemporary web servers are capable of processing this volume 

of requests, but the web server's communications link quickly becomes saturated. 

In particular, any given communications link can only transmit a finite number of 

bits per second, and when a web server using all of the available bandwidth on its 
1 5 communications link to transmit data files, no more data files may be transmitted 

through that communications link until the transfers in progress are finished. 

Most communications links cannot sustain more than a dozen concurrent data file 

transfers, and each transfer can take several seconds. 

2 0 Because of the high ratio of web browsers to web servers, it is extremely 

desirable to have a web server that is capable of handling at least ten concurrent 
data file transfers. Data communications technology, unfortunately, has not 
evolved as fast as other computer technologies. The least expensive type of 
communications link is the analog telephone line. However, the extremely 

2 5 constrained capacity of the analog telephone Line makes it unacceptable for most 
webservers. The cost of other higher-speed communications links such as Tl 
lines increases along with their capacity, and they are far more expensive to setup 
and maintain than an analog or even ISDN telephone line. 



WO 98/04985 



PCIYUS97/13097 



t 

On the other hand, because Full Tl lines are fairly cost effective, Internet 
Service Providers ("ISPs 1 ') have created profitable businesses by purchasing a Full 
Tl line, operating a web server, and renting out storage space to clients on their 
web server's storage hardware. These ISPs charge a low monthly rate for renting 
5 storage space on their web servers. Any data file that a client places in the rented 
storage space can be retrieved by web browsers that request them and, thus, clients 
make their pages and images accessible to a larger number of concurrent browsers 
without having to pay for the full cost of the full Tl themselves. 

1 0 However, in order to maintain control and security of their web servers, 

the ISPs usually limit their clients to publishing static pages. Static pages are only 
stored and retrieved and, thus, dynamic and interactive pages are not permitted. 
Dynamic and interactive pages are essentially separate software programs that 
produce pages as their output. Since the software program is executed each time a 

1 5 web browser computer requests the page, the program may produce a different 
page each time, specifically for the web browser that made the request. This 
facility is becoming increasingly popular since it enables the interconnection of 
web server software and other software programs, such as databases, electronic 
commerce systems, e-mail systems, stock quote services, etc. Therefore, in order 

20 to have dynamic and interactive pages, a publisher cannot rent space from an ISP 
but must setup and maintain his or her own custom web server which is often 
prohibitively expensive. 

Accordingly, the web serving system of the present invention has the 
2 5 capability of delivering text, graphics, video, audio and interactive multimedia 
over the Internet's World Wide Web at extremely high speeds, using only 
inexpensive, readily available hardware, software and services. The system 
requires no more effort than alternative systems, yet produces the highest 
performance-to-price ratio of any comparable World Wide Web publishing 
30 system. 
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SUMMARY OF THE INVENTION 

The present invention is a collaborative server system capable of providing 
high speed data transmission of data files across a communications network 
which, in brief summary, comprises a communications network, a primary server 
5 having a primary communications means for connecting the primary server to the 
communications network, and at least one secondary server having a secondary 
communications means for connecting the secondary server to the 
communications network. The primary server and the at least one secondary 
server include means for storing data files and means for transmitting the data files 
10 to the communications network. The data files include static data files and/or 

dynamic data files. The storage means of the primary server further stores at least 
one look-up table having specific criteria pertaining to the data files and the 
primary and at least one secondary servers. The processor means of the primary 
server is effective to receive a request for specific data files from a network user, 
15 to look-up specific criteria in the look-up table pertaining to the specific data files, 
and to allocate transmission of each specific data file between the primary server 
and the at least one secondary server based on the specified criteria. 

The storage means of the primary server includes static data files and 
2 0 dynamic data files, and the storage means of the secondary server includes a 
duplicate of the static data files. Thus, the processor means includes means for 
transferring the duplicate of the static data files from the storage means of the 
primary server to the storage means of the secondary server. Accordingly, the 
processor means of the primary server is capable of providing the dynamic files of 
25 the primary server and the duplicate of the static data files of either the primary 
server or the secondary server to the communications network in response to a 
single request for the static data files and dynamic data files of the primary server. 

In addition, it is preferred that the secondary communications means of the 
30 secondary server transmits and receives communications signals to and from 
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certain user's computers at faster rates, with lower latency, more cost-effectively, 
or more efficiently than that of the primary communications means of the primary 
server. In particular, the primary communications means of the primary server is a 
lower-cost connection to the communications network (i.e., an analog 
5 communication line), and the secondary communication means is a connection 
with better bandwidth, latency, cost-effectiveness, and efficiency (i.e., a digital Tl 
communication line). 

Further, the system includes at least one computer system that is connected 
10 to the communications network and has the capability of sending a request to the 
primary server. In addition, the processor means of the primary server includes 
means for determining an optimum server from the group of servers including the 
primary and secondary server to transmit the duplicate of the static data files to the 
computer system when two or more servers are available. The criteria for 
1 5 determining which one of the servers shall transmit includes transmission speed 
and available capacity of the primary communications means, proximity of the 
computer system to each server, availability of each server, version of the 
duplicate of the data file on each server and financial cost of transmitting data 
from each server. 

20 

BRIEF DESCRIPTION OF THE DRAWINGS 

The foregoing and still further objects and advantages of the present 
invention will be more apparent from the following detailed explanation of the 
2 5 preferred embodiments of the invention in connection with the accompanying 
drawings: 

Fig. 1 is a diagrammatic view of the preferred web serving system in 
accordance with the present invention; 

30 



WO 98/04985 



PCT/US97/13097 



10 



Fig. 2. is a flow diagram representing the operation of the primary web 
server's computer when used with the supplemental web server software of Fig. 1 ; 

Fig. 3 is a flow diagram of the DEMAND MO subroutine of Fig. 2; 

Fig. 4 is a flow diagram of the BEST SI subroutine of Fig. 2; 

Fig. 5 is a flow diagram of the REWRITE subroutine of Fig. 2; 

Fig. 6 is a preferred flow diagram of the REWRITE subroutine of Fig. 2; 



15 



and 



Fig. 7 is a flow diagram of the POSTPROCESS subroutine of Fig. 2. 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

Referring to the drawings, and in particular, to Fig. 1, there is provided a 
web serving system of the preferred embodiment which is generally represented 
by reference numeral 1 0. Web serving system 10 has the capability of delivering 

20 text, graphics, audio recordings, video recordings, computer animation, and 
computer programs and data to remote computer systems and storage devices 
through a distributed data network at extremely high speeds, often in excess of 
300,000 bits per second, in a way that is interoperable with the World Wide Web's 
hypertext protocol. In particular, web serving system 10 combines a low-cost 

2 5 communications channel of a primary server with a more desirable 

communications channel of one or more secondary servers as well as computer 
storage and retrieval services for both the low speed and high speed channels. By 
coordinating data retrieval from the various servers, the primary server is able to 
control the data files sent by each server so that the speed in which data files are 

30 delivered to a target browser computer is maximized while retaining the ability to 
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provide dynamic and interactive pages (i.e., dynamic data files). In addition, the 
setup and maintenance costs of web serving system 10 are substantially less than 
that of conventional web serving systems since its secondary web server takes 
advantage of the low cost of renting or otherwise using storage space in existing 
5 servers and its primary web server takes advantage of lower-cost, lower- 
bandwidth communications links including analog telephone lines and lower- 
speed digital connections. 

When used with an analog telephone line, preferred web serving system 10 
10 shown in Fig. 1 is capable of fulfilling requests for over 100,000 data files per day 
at a minimal cost. The available capacity of the system is dependent on a number 
of factors and conditions, but the system can reliably sustain performance in the 
400Kbps range and process many transfers concurrently. Also, the monthly cost 
per concurrent connection is much lower than that of a conventional web serving 
15 system that uses a 56K line connection, ISDN connection, fractional Tl 
connection, or Full Tl connection. 

As shown in Fig. 1, preferred web serving system 10 includes one or more 
web browser computer systems 12, a primary web server 14 and one or more local 

2 0 web browser computer system 1 5 connected on a local area network (LAN) 35, 
one or more secondary web servers 1 6 and one or more satellite web browser 
computer system 15 connected on a LAN 51, and a communications network 18. 
Web browser computer system 12 may be a web user connected on a local area 
network (LAN) 35 with primary web server 14 (local user), connected on a LAN 

25 51 with secondary web server 1 6 (satellite user) or not connected to either LAN 35 
or LAN 51 (remote user). 

Each web browser computer system 12 includes a user's computer 20, web 
browser software 22 residing in computer 20, communications hardware 24 
30 connected to computer 20, and communications link 26 connecting 
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communications hardware 24 to communications network 1 8. In particular, user's 
computer 20 may be any type of computer system that is capable of running user's 
web browser software 22 and connecting to communications network 18. For 
example, user's computer 20 may be a personal computer or workstation that 
5 includes a memory portion for storing user's web browser software 22 and a 
processor for executing web browser software 22 so that the user may interact 
with software 22. Also, user's computer 20 is connected to user's communications 
hardware 24 which, in turn, is linked to communications network 1 8 by user's 
communications link 26. For example, a modem may be used to link user's 
10 computer 20 to an Internet Service Provider ("ISP") through a telephone line. 

Since a wide variety of different computer systems may connect to the network, it 
is to be understood that user's computer 20 of the present invention is not limited 
to the personal computer or workstation examples described above. Note that web 
browser computer systems 1 2 which are connected to either LAN 35 or LAN 5 1 
1 5 respectively employ the same communications hardware and link as primary 
server system 14 and secondary server system 16. 

Primary web server 14 includes a primary computer 28, storage hardware 
30 connected to computer 28, web server software 32 and supplemental web 
20 server software 34 residing in computer 28, communications hardware 36 

connected to computer 28, and communications link 38 providing a connection 
between communications hardware 36 and communications network 1 8. Also, 
primary storage hardware 30 may reside internal or external to primary computer 
28 and is capable of storing primary data files, such as original dynamic data files 

25 40 and original static data files 42. Dynamic data files 40 include standardized 
tags as well as text data, whereas static data files 42 typically include non-text 
data, such as image data, animation data, video data, audio data, and computer 
programs. Primary data files (40, 42) are commonly transmitted over packet- 
switched communications network 1 8 using TCP/IP protocols and read by user's 

3 0 web browser software 22. Primary computer 28 may be any type of computer 
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system that is capable of running primary web server software 32 and 
supplemental web server software 34, and connecting to communications network 
18/- A wide variety of computer systems, ranging from mainframe computer to 
personal computers, are capable of running web server software 32 with 

5 supplemental web server software 34 and, thus, primary computer 28 is affordable 
and readily available. For example, primary computer 28 may be a personal 
computer or workstation that includes a memory portion for storing primary web 
server software 32 and supplemental web server software 34, and a processor for 
executing both software. Also, similar to web browser computer system 12, 

10 primary computer 28 is connected to primary communications hardware 36 which, 
in turn, is linked to communications network 1 8 by primary communications link 
38. 

In addition, the present invention includes one or more secondary web 
15 servers 16. Although a secondary web server 1 6 may be purchased and setup for 
the present invention, the services of such secondary web server 16 is readily 
available to avoid their high setup and maintenance costs, which is a feature of the 
present invention. For example, one may rent space on a secondary web server 
from an Internet Service Provider for a low fee, or arrange to use space on a 
2 0 secondary web server owned by an organization or university. 

Typically, each secondary web server 16 includes a secondary computer 
44, storage hardware 46 connected external or provided internal to secondary 
computer 44, web server software 48 and remote storage access software 50 

2 5 residing in secondary computer 44, communications hardware 52 connected to 

secondary computer 44, and communications link 54 connecting secondary 
computer 44 to communications network 18. Secondary storage hardware 46 is 
capable of storing data files 56 that are duplicates of the original data files (40, 42) 
in primary storage hardware 30 of primary web server 14. Also, secondary 

3 0 communications link 54 has superior operating characteristics than that of primary 
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communications link 38. For example, as shown in Fig. 1, secondary 
communications link 54 is a Full Tl line that has higher performance capacity 
than primary communications link 38 which is an analog telephone line. Also, 
similar to primary computer 28, secondary computer 44 may be any type of 
5 computer system, ranging from personal computer to mainframe computers, that 
is capable of running secondary web server software 48 and remote storage access 
software 50, and connecting to communications network 1 8. Further secondary 
computer 44 is connected to secondary communications hardware 52 which, in 
turn, is linked to communications network 18 by secondary communications link 
10 54. 

Computer 28 of the primary web server 14 executes supplemental web 
server software 34 to perform the critical function of coordinating file transfers by 
primary web server 14 and secondary web server or servers 1 6. Computer 28 and 

15 supplemental web server software 34 exploit the fact that the ultimate audience, 
namely the users who accessing primary web server 14, for data files (40, 42) is 
not involved in the retrieval of each data file individually. In actuality, the task of 
retrieving data files, particularly static data files 42 that are embedded in pages, is 
performed automatically by the user's web browser software, such as web browser 

20 software 22. 

Primary computer 28, when used with supplemental web server software 
34, adds significant and powerful capabilities to primary web server software 32. 
In particular, primary computer 28 identifies the static files (i.e., typically non-text 

2 5 data files) that are consuming most of the capacity of primary communications 
link 38, copies those capacity-consuming static data files 42 to one or more 
secondary web servers 16 (which have higher capacity communications links), and 
then dynamically re-writes requested pages so that the embedded static data files 
are retrieved directly from the best or optimal web server each time. Primary 

30 computer 28 also records every request for a data file (40, 42), builds a statistical 
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model of how the capacity of primary communications link 38 is being consumed 
by the transmission of the various data files (40, 42), determines which static data 
files 42 are consuming the most capacity, uses standardized communication 
protocols to communicate through the channel between primary and secondary 
5 web servers (14, 16), uses the communication channel to copy static data files 42 
from primary storage hardware 30 to secondary storage hardware 46, and tracks 
the progress and location of all duplicated files 56 through the use of a database. 

In addition, primary computer 28 responds to requests for pages stored on 

10 the primary storage hardware 30 by reading the data file (40, 42) from the primary 
storage hardware 30, finding all tags in the data file (40, 42) that refer to static 
data files 42 stored on primary web server 14, examining the database to retrieve a 
list of secondary web servers 16 and data files 56 where instantiation of static data 
file 56 referred to by each tag may be found. Primary computer 28 also responds 

15 to requests for pages stored on primary storage hardware 30 by determining, for 
each static data file 42 referred to by the page's tags, which available instantiation 
would be the best instantiation to transmit to web browser computer system 12. 
Considerations for determining the best instantiation includes the transmission 
speed of each web server's communications link (38, 54), the currently available 

20 capacity of each web server's communications link (38, 54), the proximity of web 
browser computer system 12 to each web server (14, 16) given the topology of the 
communications network 1 8, the current availability or operability of each 
secondary web server 16, the version of the data file 42 available on each web 
server (14, 16) in which only instantiations that are current are eligible, the cost of 

2 5 transmitting the data file from each server, the response latency of each server 

with respect to the requesting computer and the efficiency of transmitting the data 
file from each server. Primary computer 28 further responds to requests for pages 
stored on the primary storage hardware 30 by creating a re-written page, in which 
each tag that refers to a particular data file 42 is replaced with a tag that is 
30 functionally identical except that the tag now refers to the best instantiation rather 
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than to original data file 42. Also, the re- written page is transmitted through the 
primary connections link to web browser computer system 12, statistical 
information about the transmission of the page and the supposed transmission of 
any static data file 42 whose best instantiation is not on primary web server 14 is 
5 recorded, various statistics are used to update the database with revised 

information about the usage levels and patterns of data files (40, 42, 56) on both 
primary web server 14 and secondary web servers 16, and statistical information 
in the database is used to guide primary computer 28 in optimizing of the 
distribution of data file instantiations across multiple servers. 

10 

Once static data files 42 have been copied to one or more secondary web 
servers 1 6, only original dynamic data files 40, particularly their text-and-tags 
information, need to be transferred through primary communications link 38. 
Most static data files 56 that are requested from outside the local area network are 

1 5 transmitted over the higher-speed, higher-capacity, lower-latency or more efficient 
links of secondary communications hardware 52. If one secondary web server 16, 
whose communications link 54 is a Full Tl line, is used, the speed at which the 
primary computer 28 can deliver pages and static data files can be as high as about 
1200Kbps for short bursts and tends to run consistently about 300Kbps to about 

20 400Kbps. Thus, by combining the resources of a low-cost, low-bandwidth 
communications link 38 and some low-cost storage space on one or more 
secondary web servers 16 with high speed, high capacity communications links 
54, the result is a web serving system which is fast, interactive and inexpensive. 

25 It is important to note that the present invention provides high capacity 

performance, reliably sustaining about 400 Kilobits per second of capacity, while 
providing the capability of handling dynamic and interactive pages and benefiting 
from the economical startup and maintenance costs of a low cost, low bandwidth 
communications link. In particular, when used with an analog telephone line, the 

30 present invention is capable of delivering performance that is comparable to a 
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system having a fractional Tl connection and surpasses a system having 56K line 
or ISDN connection. Therefore, for the present invention, an expensive digital 
communications link, such as fractional Tl connection or Full Tl connection, is 
not required for high capacity performance. In fact, the primary communications 
5 link may be an analog telephone line connected to the primary communications 
hardware, such as a modem, and yet provide performance that is comparable to a 
fractional Tl connection and outperform a 56K line or ISDN connection. 

Referring to Fig. 2, there is provided the main flow chart, beginning with 
1 0 step 60, for the process performed by primary computer 28, when using 

supplemental web server software 34, for coordinating the file transfer functions 
of primary web server 14 and secondary web server or servers 16. When a new 
request for a data file (40, 42) is received by web server software 32 as shown in 
step 62, primary computer 28 starts by examining the name of the requested file as 
1 5 using subroutine DEMAND MO shown in step 64. The coordinating means then 
selects the instance with the highest quality that is associated with this master 
object by using subroutine BEST SI as shown in step 66. 

If the best instance is not associated with the primary server, then a redirect 
20 response indicating the location of the instance master object is prepared as shown 
in steps 68 and 70. If the best instance is associated with the primary server but 
the requested data file is not a page that may contain tags or reference to other data 
files, a direct response including the contents of the instance master object is 
prepared as shown in steps 72 and 74. If the best instance is associated with the 

2 5 primary server and the requested data file contains tags or reference to other data 

files, then a revised version of the contents of the instance master object is 
generated using subroutine REWRITE as shown in step 76. Then, as shown in 
step 78, a direct response including the rewritten contents of the instance master 
object is prepared. Next, in all cases, the contents of the instance found on the 

3 0 primary server are transmitted to the web browser that requested it as shown in 
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step 80. Finally, statistical information is recorded using subroutine 
POSTPROCESS and primary computer 28 stops execution as shown in steps 82 
and 84. 

5 Referring to Fig. 3, the subroutine DEMAND-MO (step 64 of Fig. 2) 

begins at step 86 and responds to the request for the data file (40, 42) received by 
web server software 32 as shown in step 88. Primary computer 28 then searches 
the database for a master object record that matches the name given as shown in 
step 90. If there is a matching master object record in the database, then the 

1 0 coordinating means returns the master object record as shown in steps 92 and 94. 
If a matching record is not in the database, a new master object record with the 
name of the given file is created in the database as shown in step 96. Next, a new 
statistics record is created for this master object, and an instance record with the 
name of the given file is created in the instance database as shown in step 98. 

15 Also, the instance record is marked as primary and local and the instance is 
associated with primary web server 14. Finally, as shown in step 94, the new 
master object record is returned and, as shown in step 1 00, subroutine DEMAND 
MO returns to the main process of Fig. 2. 

2 0 Referring to Fig. 4, there is provided the BEST SI subroutine shown as 

step 66 in Fig. 2, starting with step 102. Primary computer 28 initially receives a 
given master object and retrieves the list of instances associated with the given 
master object from the database as shown in steps 104 and 106. As shown in 
steps 1 08, primary computer 28 determines whether there are any more matching 

2 5 records. If no more matching records exist then the primary computer 28 returns 
the instance master object having the lowest serving time and/or highest quality 
score, and the BEST SI subroutine returns to the main process of Fig. 2 as shown 
insteps 110 and 120. 



WO 98/04985 



PCT/US97/13097 



If there are any more matching records, such record is retrieved as shown 
in step 1 12. Then, the serving time is calculated by considering the transmission 
time of the data file from the given instance, given the current load on the server 
for the instance and the load and maximum speed of the communications link (38, 
5 54) for that server (14, 16). The quality score is set to be an inverse function of 
the time required, and scaled by the topological distance between the web browser 
computer system 12 making the request and the server (14, 16) for this instance. 
Cost, latency and efficiency are also considered part of the quality score. If the 
serving time is faster than any other serving time considered so far for this master 

1 0 object, then the instance is designated as having the lowest serving time, and 
primary computer 28 searches for the next record matching the master object in 
the database as shown in steps 1 16 and 118. Otherwise, if the serving time is not 
the fastest one considered, then primary computer 28 simply goes back to 
searching for the next record matching the master object in the database as shown 

15 in steps 116. 

Referring to Fig. 5, there is provided the REWRITE subroutine shown as 
step 76 in Fig. 2, starting with step 200. Initially, the instance record is received, 
and a working copy of the data file of the instance record location is retrieved, as 
20 shown in steps 202 and 204. Primary computer 28 then analyzes the working 
copy of the data file for the next embedded reference to a data file as shown in 
step 206. Primary computer 28 determines whether an embedded reference was 
found in step 208. If not, the working copy is returned without any modifications 
as shown in step 210. If an embedded reference is discovered, the DEMAND MO 

2 5 subroutine is used to find the master object record for the embedded data file as 
shown in step 212. Next, the BEST SI subroutine is used to find the best instance 
for that master object as shown in step 214. Then, in the working copy of the data 
file, the reference to the original Web Server (i.e., an address code) and Data File 
names are replaced with the names of the web server and data file where the best 

30 instance is located a shown in step 216. If there is more in the data file to read, 
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then primary computer 28 will try to read the next embedded reference to a data 
file. Finally, the primary computer 28 transmits the contents of working file to the 
web browser that requested it as shown in step 210. 

5 Referring to Fig. 6, there is provided a preferred embodiment of the 

REWRITE subroutine shown as step 76 in Fig. 2, starting with step 122. Initially, 
the instance record is received, and the output file is initialized so that it is empty 
as shown in steps 124 and 126. The next segment of text is then read from the 
data file as shown in step 128. Then in step 130, primary computer 28 determines 

1 0 whether any data was read from the file. If not, the contents of the output file are 
returned as shown in step 132. However, if data was read from the file, then all 
non-tag text is copied to the page output buffer a shown in step 1 34. When a tag 
is encountered, primary computer 28 tries to read the next tag-only segment of the 
incident master object from storage as shown in step 1 36. Next, as shown in step 

15 1 38, the primary computer 28 determines whether data was read from the file. If 
not, then the contents of the output file are returned as shown in step 132. As 
shown in steps 138 and 140, if data was read from the file, the primary computer 
28 determines whether there is a reference to another data file. If none is found, 
the tag is copied to the output file without modification as shown in step 142. 

20 However, if the tag could contain such references, the DEMAND MO subroutine 
is used to find the master object record for the name found as shown in step 144. 
Next, the BEST SI subroutine is used to find the best instance for that master 
object as shown in step 146. Then, in the tag, the reference to the original Web 
Server and Data File names are replaced with the names of the web server and 

2 5 data file where the best instance is located a shown in step 148. The tag is then 
appended to the page output buffer as shown in step 1 50. If there is more in the 
data file to read, then primary computer 28 will try to read the next non-tag 
segment. Finally, the primary computer 28 transmits the contents of the page 
output buffer to the web browser that requested it as shown in step 132. 
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if 

Referring to Fig. 7, there is provided the POSTPROCESS subroutine 
shown as step 82 in Fig. 2, starting with step 1 54. Primary computer 28 updates 
the statistical database records for the instance, the master object, the server, and 
the communications links appropriate to the instance that was selected as shown 
5 in step 1 56. Then, the first replication server is obtained as shown in step 1 58, 
and a determination is made as to whether there is a server to consider as shown in 
step 160. If not, then subroutine POSTPROCESS returns to the main process of 
Fig. 1 . If a server to consider does exist, then a determination is made of whether 
there is already an instance of this master object on this server as shown in step 
10 162. If so, the next replication server is considered as shown in step 158. If there 
is not an instance of the master object on the server, primary computer 28 
determines whether the master object is larger than the total storage capacity of 
this server as shown in step 164. If so, the next replication server is considered a 
shown in step 158. 

15 

If the master object is not larger than the total storage capacity of the 
server, a list is made of the instances, if any, that would have to be deleted from 
this server in order to accommodate the master object under consideration as 
shown in step 166. The total replication value of the list and the replication value 
20 of the master object are computed as shown in steps 168 and 1 70. The total 
replication value of the list is compared to the replication value of the master 
object as shown in step 172. If the replication value of the given master object is 
not greater than the total replication value of the list, then the next replication 
server is examined as shown in step 158. If the replication value of the given 

2 5 master object is greater than the total replication value of the list, then the remote 
instances on the list are deleted. Specifically, elements of the list are deleted from 
the given server and from the instance database as shown in step 174, and a copy 
of the master object is made onto the given server and the record of instance 
master object is added to the instance database as shown in step 176. If all 

30 secondary webservers 16 are to be treated as one pooled storage area, then the 
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POSTPROCESS subroutine returns to the main process of Fig. 2 as shown in step 
178. If not, then more servers are examined as shown in step 158. 

The present invention having been described with particular reference to 
5 the preferred forms thereof, it will be obvious that various changes and 

modifications may be made therein without departing from the spirit and scope of 
the invention as defined in the appended claims. 
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WHAT IS CLAIMED IS : 

1 . An apparatus connected to a network, for receiving a request for data files 
from a network user and allocating transmission of the data files between 
predetermined devices connected to the network, said apparatus comprising: 
5 memory means for storing data files and at least one look-up table 

having specific criteria pertaining to said data files and said predetermined 
devices, said data files including dynamic data files and/or static data files; and 
data allocation means which is capable of: 

(i) receiving a request for specific data files, from a network 
10 user, 

(ii) looking up specific criteria in said look-up table pertaining 
to said specific data files, and 

(iii) allocating transmission of each specific data file between 
said predetermined devices based on said specific criteria; 

15 and 

means for transmitting data to said network user. 

2. The apparatus as recited in claim 1 , wherein said data allocation means 
causes said transmitting means to transmit any dynamic data file in said specific 

2 0 data files to said network user. 

3. The apparatus as recited in claim 1 , wherein said data allocation means 
selects an optimum device from said determined devices for each static data file in 
said specific data file and causes said network user to request said each static data 

2 5 file from a corresponding selected optimum device. 

4. The apparatus as recited in claim 3, wherein said data allocation means 
transmits an address code of the corresponding selected optimum device for each 
static data file to said network user. 

30 
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5. The apparatus as recited in claim 4, wherein said network user, responsive 
to said receipt of said address code for each static data file, requests each of said 
static data files from the corresponding selected optimum device 

5 6. The apparatus as recited in claim 1 , further comprising means for 

transferring a copy of any data file stored in said memory means to any of said 
predetermined devices. 

7. The apparatus as recited in claim 1, further comprising means for updating 
10 said specific criteria in said look-up table of said memory means. 

8. The apparatus as recited in claim 1, wherein said specific criteria include a 
latency of a connection between each of said predetermined devices and said 
network user. 

15 

9. The apparatus as recited in claim 1, wherein said specific criteria include 
an estimated cost of transmitting the data file from each of said predetermined 
devices to the network user. 

20 10. The apparatus as recited in claim 1 , wherein said specific criteria include a 
transmission bandwidth of each of said predetermined devices. 

1 1 . The apparatus as recited in claim 1 , wherein said specific criteria include a 
current capacity of each of said predetermined devices to transmit data files to said 

25 network user. 

12. The apparatus as recited in claim 1 , wherein said specific criteria include a 
version type of data files retrievable by each of said predetermined devices. 
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13. The apparatus as recited in claim 1, wherein at least one of said specific 
data files includes embedded references to additional data files. 

14. The apparatus as recited in claim 1, wherein said data allocation means is 
5 further capable of looking up specific criteria in said look-up table pertaining to 

said additional data files, and allocating transmission of each of said additional 
data files between said predetermined devices, based on said specific criteria 
pertaining to said additional data files. 

10 15. The server system as recited in claim 1 4, wherein said embedded 

references include corresponding address codes for each of said additional data 
files, said allocation means having means for rewriting said address codes. 

16. A collaborative server system for providing high speed data transmission 

1 5 of data files across a network, comprising: 

a primary server and a secondary server, both connected to a 
network across a communication medium, said primary server comprising: (a) 
memory means for storing data files and at least one look-up table having specific 
criteria pertaining to said data files and said primary server and said at least one 

20 secondary server, said data files including dynamic data files and/or static data 
files; (b) data allocation means which is capable of: (i) receiving a request for 
specific data files from a network user, (ii) looking up specific criteria in said 
look-up table pertaining to said specific data files, and (iii) allocating transmission 
of each specific data file between said primary server and said at least one 

2 5 secondary server based on said specific criteria; and (c) means for transmitting 
data across said network, 

said at least one secondary server comprising: secondary memory 
means for storing data files, said data files including dynamic data files and/or 
static data files, and means for transmitting data across said network. 
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17. The server system as recited in claim 16, wherein said data allocation 
means causes said transmitting means of said primary server to transmit any 
dynamic data file in said specific data files to said network user. 

5 18. The server system as recited in claim 1 6, wherein said data allocation 
means selects an optimum server from a group including said primary server and 
said secondary server for each static data file in said specific data file and causes 
said network user to request said each static data file from a corresponding 
selected optimum device. 

10 

1 9. The server system as recited in claim 1 8, wherein said data allocation 
means transmits an address code of the corresponding selected optimum device 
for each static data file to said network user. 

1 5 20. The server system as recited in claim 1 9, wherein said network user, 
responsive to said receipt of said address code for each static data file, requests 
each of said static data files from the corresponding selected optimum device 

21 . The server system as recited in claim 1 6, wherein said primary server 

2 0 further comprises means for transferring a copy of any data file stored in said 

primary memory means to said secondary memory means of said at least one 
secondary server. 

22. The server system recited in claim 16, wherein said primary server further 
25 comprises means for updating said specific criteria in said look-up table of said 

primary memory means. 

23. The server system as recited in claim 16, wherein said specific criteria 
include a latency of a connection between said primary server and said at least one 

3 0 secondary server and said network user. 
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24. The server system as recited in claim 1 6, wherein said specific criteria 
include an estimated cost of transmitting the data file from said primary server and 
said at least one secondary server to the network user. 

5 

25. The server system as recited in claim 16, wherein said specific criteria 
include a transmission bandwidth of said primary server and said at least one 
secondary server. 

10 26. The server system as recited in claim 16, wherein said specific criteria 
include a current capacity of said primary server and said at least one secondary 
server to transmit data files to said network user. 

27. The server system as recited in claim 1 6, wherein said specific criteria 

1 5 include a version type of data files retrievable by said primary server and said at 
least one secondary server. 

28. The server system as recited in claim 1 6, wherein at least one of said 
specific data files includes embedded references to additional data files. 

20 

29. The server system as recited in claim 28, wherein said data allocation 
means is further capable of: looking up specific criteria in said look-up table 
pertaining to said additional data files, and allocating transmission of each of said 
additional data files between said predetermined devices, based on said specific 

2 5 criteria pertaining to said additional data files. 

30. The server system as recited in claim 29, wherein said embedded 
references include corresponding address codes for each of said additional data 
files, said data allocation means having means for rewriting said address codes. 

30 
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3 1 . The server system as recited in claim 16, wherein said primary server is 
connected to the network across a slower transmission medium than said at least 
one secondary server. 

5 32. The server system as recited in claim 3 1 , wherein said primary server is 
connected to the network across an analog transmission line and said at least one 
secondary server is connected to the network across a faster digital transmission 
line. 

10 33. The server system as recited in claim 16, wherein said network user 
includes means for requesting and receiving data files, via the network. 

34. The server system as recited in claim 33, wherein said network user further 
includes means for generating a web page from said specific data files. 

15 

35. The server system as recited in claim 16, wherein said network user and 
said primary server are connected in a local area network (LAN), said LAN being 
connected to said network. 

2 0 36. The server system as recited in claim 1 6, wherein said network user and 
said at least one secondary server are connect to a local are network (LAN), said 
LAN being connected to said. 

37. An apparatus connected to a network, for receiving a request for data file 
25 having embedded therein references to additional data files from a network user 

and allocating transmission of all of the data files between predetermined devices 

connected to the network, said apparatus comprising: 

memory means for storing data files and at least one look-up table 

having specific criteria pertaining to said data files and said predetermined 
30 devices, said data files including dynamic data files and/or static data files; and 
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data allocation means which is capable of: 

(i) receiving a request for at least one specific data file having 

embedded therein references to additional data files, from a 

network user, 

5 (ii) looking up specific criteria in said look-up table pertaining 

to each of said additional files, and 
(iii) allocating transmission of each of said at least one specific 
data file and each of said additional data files between said 
predetermined devices based on said specificcriteria; and 
1 0 means for transmitting data to said network user. 

38. The apparatus as recited in claim 37, wherein said embedded references 
include corresponding address codes for each of said additional data files, said 
data allocation means having means for rewriting said address codes. 
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